word attention
Observing Dialogue in Therapy: Categorizing and Forecasting Behavioral Codes
Cao, Jie, Tanana, Michael, Imel, Zac E., Poitras, Eric, Atkins, David C., Srikumar, Vivek
Automatically analyzing dialogue can help understand and guide behavior in domains such as counseling, where interactions are largely mediated by conversation. In this paper, we study modeling behavioral codes used to asses a psychotherapy treatment style called Motivational Interviewing (MI), which is effective for addressing substance abuse and related problems. Specifically, we address the problem of providing real-time guidance to therapists with a dialogue observer that (1) categorizes therapist and client MI behavioral codes and, (2) forecasts codes for upcoming utterances to help guide the conversation and potentially alert the therapist. For both tasks, we define neural network models that build upon recent successes in dialogue modeling. Our experiments demonstrate that our models can outperform several baselines for both tasks. We also report the results of a careful analysis that reveals the impact of the various network design tradeoffs for modeling therapy dialogue.
Different Flavors of Attention Networks for Argument Mining
Frau, Johanna (National University of Córdoba) | Teruel, Milagro (National University of Córdoba) | Alemany, Laura Alonso (National University of Córdoba) | Villata, Serena (Université Côte d'Azur)
Argument mining is a rising area of Natural Language Pro- cessing (NLP) concerned with the automatic recognition and interpretation of argument components and their relations. Neural models are by now mature technologies to be ex- ploited for automating the argument mining tasks, despite the issue of data sparseness. This could ease much of the man- ual effort involved in these tasks, taking into account hetero- geneous types of texts and topics. In this work, we evaluate different attention mechanisms applied over a state-of-the-art architecture for sequence labeling. We assess the impact of different flavors of attention in the task of argument compo- nent detection over two datasets: essays and legal domain. We show that attention not models the problem better but also supports interpretability.
Word Attention for Sequence to Sequence Text Understanding
Wu, Lijun (Sun Yat-sen University) | Tian, Fei (Microsoft Research) | Zhao, Li (Microsoft Research) | Lai, Jianhuang (Sun Yat-sen University) | Liu, Tie-Yan (Microsoft Research)
Attention mechanism has been a key component in Recurrent Neural Networks (RNNs) based sequence to sequence learning framework, which has been adopted in many text understanding tasks, such as neural machine translation and abstractive summarization. In these tasks, the attention mechanism models how important each part of the source sentence is to generate a target side word. To compute such importance scores, the attention mechanism summarizes the source side information in the encoder RNN hidden states (i.e., h_t), and then builds a context vector for a target side word upon a subsequence representation of the source sentence, since h_t actually summarizes the information of the subsequence containing the first t-th words in the source sentence. We in this paper, show that an additional attention mechanism called word attention, that builds itself upon word level representations, significantly enhances the performance of sequence to sequence learning. Our word attention can enrich the source side contextual representation by directly promoting the clean word level information in each step. Furthermore, we propose to use contextual gates to dynamically combine the subsequence level and word level contextual information. Experimental results on abstractive summarization and neural machine translation show that word attention significantly improve over strong baselines.